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Rosenberg 


Abstract 


The  power  of  Butterfly-type  networks  relative  to  other  proposed  multicomputer  interconnection  networks  is 
studied,  by  considering  how  efficiently  the  Butterfly  can  simulate  the  other  networks.  Simulation  is 
represented  formally  via  graph  embeddings,  so  the  topic  here  becomes:  How  efficiently  can  one  embed 
the  graph  underlying  a  given  network  in  the  graph  undertying  the  Butterfly  network?  The  efficiency  of  an 
embedding  of  a  graph  G  in  a  graph  H  is  measured  in  terms  of:  the  dilation,  or,  the  maximum  amount  that 
any  edge  of  G  is  “stretched"  by  the  embedding;  the  expansion,  or,  the  ratio  of  the  number  of  vertices  of  H 
to  the  number  of  vertices  of  G.>Three  general  results  about  embeddings  in  Butterfly-type  graphs  are 
established  here,  that  expose  a  Isumber  of  simulations  by  Butterfly-type  networks,  which  are  optimal  (to 
within  constant  factors):  (1 )  Any  complete  binary  tree  can  be  embedded  in  a  Butterfly  graph,  with 
simultaneous  dlation  0(1)  and  expansion  0(1).  (2)  Any  n-vertex  graph  having  a  /2-bffurcator  of  size  S  = 
n( log  n)  can  be  embedded  in  a  Butterfly  graph  with  simultaneous  dlation  0(log  S)  and  expansion  0(1 ).  (3) 
Any  embedding  of  a  planar  graph  G  in  a  Butterfly  graph  must  have  dlation  0{  [log  E  ( G)]/*  ( G)>:  S  ( G)  is  the 
size  of  the  smallest  1/3-2/3  vertex-separator  of  G;  t(G)  is  the  size  of  G’s  largest  interior  face.  Corollaries 
include:  (a)  The  n-vertex  X-tree  can  be  embedded  in  the  Butterfly  with  simultaneous  dlation  0(log  log  n) 
and  expansion  0(1);  no  embedding  yields  smaller  dlation.  Independent  of  expansion,  (b)  Every 
embedding  of  the  n  xn  mesh  in  th&Butterfly  has  dlation  fl(log  n);  any  expansion-O(i)  embedding  of  the 
mesh  in  the  Butterfly  achieves  this  daSTton^These  results,  which  extend  to  Butterfly-like  graphs  such  as  the 
Cube-Connected  Cycles  and  Benes  networks,  supply  the  first  examples  of  graphs  that  can  be  embedded 
more  efficiently  in  the  Hypercube  than  in  the  Butterfly.  d,/  , .  - 
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Abstract 


The  power  of  Butterfly-type  networks  relative  to  other  proposed  multicomputer 
interconnection  networks  is  studied,  by  considering  how  efficiently  the  Butterfly 
can  simulate  the  other  networks.  Simulation  is  represented  formally  via  graph 
embeddings,  so  the  topic  here  becomes:  How  efficiently  can  one  embed  the  graph 
underlying  a  given  network  in  the  graph  underlying  the  Butterfly  network?  The 
efficiency  of  an  embedding  of  a  graph  G  in  a  graph  H  is  measured  in  terms  of: 
the  dilation ,  or.  the  maximum  amount  that  any  edge  of  G  is  “stretched”  by  the 
embedding;  the  expansion ,  or,  the  ratio  of  the  number  of  vertices  of  H  to  the 
number  of  vertices  of  G.  Three  general  results  about  embeddings  in  Butterfly-type 
graphs  are  established  here,  that  expose  a  number  of  simulations  by  Butterfly-type 
networks,  which  are  optimal  (to  within  constant  factors):  (1)  Any  complete  binary 
tree  can  be  embedded  in  a  Butterfly  graph,  with  simultaneous  dilation  0(1)  and 
expansion  0(1).  (2)  Any  n-vertex  graph  having  a  \/2-bifurcat.or  of  size  S  =  n(log  />) 
ran  be  embedded  in  a  Butterfly  graph  with  simultaneous  dilation  O(logS)  and 
expansion  0(1).  (3)  Any  embedding  of  a  planar  graph  G  in  a  Butterfly  graph 
must  have  dilation  ft  Z(G)  is  the  size  of  the  smallest  1/3-2/3  vertex- 

separator  of  G\  <fr(G)  is  the  size  of  O’s  largest  interior  face.  Corollaries  include:  (a) 
The  n-vertex  X-tree  can  be  embedded  in  the  Butterfly  with  simultaneous  dilation 
0( log  log  n)  and  expansion  0(1);  no  embedding  yields  smaller  dilation,  independent 
of  expansion,  (b)  Every  embedding  of  the  n  x  n  mesh  in  the  Butterfly  has  dilation 
fi(logn);  any  expansion-O(l)  embedding  of  the  mesh  in  the  Butterfly  achieves  this 
dilation.  These  results,  which  extend  to  Butterfly-like  graphs  such  as  the  Cube- 
Connected  Cycles  and  Benes  networks,  supply  the  first  examples  of  graphs  that  can 
be  embedded  more  efficiently  in  the  Hypercube  than  in  the  Butterfly. 


1.  INTRODUCTION 

This  paper  reports  on  a  continuing  program  of  the  authors,  dedicated  to  determining 
the  relative  computational  capabilities  of  the  various  interconnection  networks  that 
have  been  proposed  for  use  as  multicomputer  interconnection  networks  (BCLR,  BI, 
GHR,  Le|.  We  focus  here  on  one  member  of  the  family  of  butterfly-like  machines, 
that  have  become  one  of  the  benchmark  architectures  for  multicomputers.  The 
major  contributions  of  this  paper  are  the  following  general  results  about  embeddings 
of  graphs  in  Butterfly  networks1: 

1.  We  embed  the  complete  binary  tree  in  the  Butterfly  network,  with  simulta¬ 
neous  dilation  0(1)  and  expansion  0(1). 

2.  We  embed  any  n-vertex  graph  having  a  \/2-bifurcator  of  size  S  —  fl(logn) 
in  the  Butterfly  network,  with  simultaneous  dilation  0(log  S )  and  expansion 
0(1). 

3.  We  prove  that  any  embedding  of  any  planar  graph  G  in  a  Butterfly  network 
must  have  dilation 

«(*$?). 

where:  E(G')  is  the  size  of  the  smallest  1/3-2/3  vertex-separator  of  G\  $((7) 
is  the  size  of  G”s  largest  interior  face. 

The  latter  two  results  lead  to  embeddings  of  graphs  such  as  X-trees  and  meshes  in 
the  Butterfly,  that  are  optimal,  to  within  constant  factors.  By  Result  2,  such  embed¬ 
dings  can  be  found  with  expansion  0(1)  and  with,  respectively,  dilation  O(loglogn) 
and  O(logn);  by  Result  3,  no  embeddings  can  improve  on  these  dilations,  indepen¬ 
dent  of  expansion.  These  embeddings  expose  X-trees  and  meshes  as  the  first  known 
graphs  that  can  be  embedded  very  efficiently  in  the  Hypercube  (simultaneous  dilation 
0(1)  and  expansion  0(1))  but  have  no  efficient  embedding  in  butterfly-like  graphs. 
Note  that,  if  we  restrict  attention  only  to  the  issue  of  dilation,  then  -  to  within  con¬ 
stant  factors  ~  these  graphs  cannot  be  embedded  any  more  efficiently  in  Butterfly 
graphs  than  they  can  in  complete  binary  trees! 


1.1.  The  Formal  Setting 

The  technical  vehicle  for  our  investigations  is  the  following  notion  of  graph  embed¬ 
ding  (Roj.  Let  G  and  H  be  simple  undirected  graphs.  An  embedding  of  G  in  H  is  a 

1  AH  technical  terms  are  defined  in  Section  1.1. 
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one-to-one  association  of  the  vertices  of  G  with  vertices  of  H ,  plus  a  routing  of  each 
edge  of  <7  within  H ,  i.e.,  an  assignment  of  a  path  in  H  connecting  the  images  of 
the  endpoints  of  each  edge  of  G.  The  dilation  of  the  embedding  is  the  length  of  the 
longest  path  in  H  that  routes  an  edge  of  G;  it  thus  measures  how  much  the  edges  of 
G  are  “stretched”  by  the  embedding.  The  expansion  of  the  embedding  is  the  ratio 
\H\/\G\  of  the  number  of  vertices  in  H to  the  number  of  vertices  in  G.  We  use  the 
dilation-  and  expansion-costs  of  the  best  embedding  of  G  in  H  as  our  measures  of 
how  well  H  can  simulate  G  as  an  interconnection  network:  One  views  the  graph  //  as 
abstracting  the  processor-intercommunication  structure  of  a  physical  architecture; 
one  views  the  graph  G  as  abstracting  either  the  task-interdependency  structure  of 
an  algorithm  one  wants  to  implement  on  H  or  the  processor-intercommunication 
structure  of  am  architecture  one  wants  to  simulate  on  H . 

Remark.  A  third  important  measure  of  how  well  H  can  simulate  G  is  congestion , 
the  maximum  number  of  edges  that  are  routed  through  a  single  edge  (or  vertex)  of 

H .  Congestion  does  not  play  a  major  role  in  this  paper,  however,  since 

I.  our  embedding  of  a  complete  binary  tree  in  a  Butterfly  trivially  has  unit  conges¬ 
tion; 

2.  the  n-vertex  Butterfly  is  known  to  be  able  to  simulate  any  n-vertex  bounded- 
degree  graph  with  O(logn)  delay,  irrespective  of  the  fact  that  the  dilation  and 
congestion  of  the  corresponding  embedding  may  both  be  Q(logn);  3.  our  major 
focus  is  on  developing  broadly  applicable  techniques  for  bounding  the  dilation  of 
embeddings. 

Hence,  for  our  purposes,  dilation  is  the  central  measure  of  concern. 

Our  results  hold  for  a  large  variety  of  “levelled”  Hypercube-derivative  host 
graphs  (which  play  the  role  of  our  H' s),  that  we  collectively  term  butterfly  net¬ 
works.  For  the  sake  of  rigor,  we  focus  on  one  particular  such  network  (which  can 
be  viewed  as  the  FFT  network,  with  input  and  output  vertices  identified),  although 
we  could  just  as  easily  substitute  other  such  graphs  -  the  Cube-Connected  Cycles 
[PV]  or  Benes  network  (Be),  for  example.  Formally, 


•  Let  m  be  a  positive  integer.  The  m-level  Butterfly  graph  B(m)  has  vertex-set2 


Fm  =  {0, 1,  •  •  • ,  m  -  1}  x  {0, 1}”*. 


The  subset  Vm  t  =  {£}  x  {0,  l}m  of  Vm  (0  <  £  <  m)  is  the  £th  level  of  B(m). 
The  string  i  €  {0,  l}m  of  vertex  (£,  i)  is  the  position-within-levcl  string  ( PWL 
string ,  for  short)  of  the  vertex.  The  edges  of  B(m)  form  butterflies  (or,  copies 

" (0, 1}"'  denotes  the  set  of  length-m  binary  strings. 
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000 
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\0\  Oil 
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Figure  1:  The  3-level  Butterfly  graph  B(3) 

of  A'„)  between  consecutive  levels  of  vertices,  with  wraparound  in  the  sense 
that  level  0  is  identified  with  level  m.  Each  butterfly  connects  vertices 

{(■,  0o0i  •  •  •  0t~iQ0t+i  •  •  •  0m-l) 


and 


{t,  0001  ’  4  ■  0t-ll0t+l  '  ’  '  0m-l) 


on  level  t  of  B(m )  (0  <  l  <  m;  each  /?,  6  {0, 1})  with  vertices 
(t  +  l(mod  m),  0o0i  ■  •  •  0l-lO0e+1  •  •  •  0m.x) 


and 

{t+  l(mod  m),  0o0i  ■  •  •  0t-M+i  -  •  •  0m-i) 

on  level  i  +  l(mod  m)  of  B(m).  One  can  represent  B{m)  level  by  level,  in 
such  a  way  that  at  each  level  the  PWL  strings  are  the  reversals  of  the  binary 
representations  of  the  integers  0, 1, . . .  ,2m  —  1,  in  that  order.  See  Fig.  1. 

The  guest  graphs  in  our  study,  which  play  the  role  of  our  G' s,  are  complete 
binary  trees,  X-trees,  and  meshes;  see  Fig.  2.  Formally, 
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•  The  htight-h  complete  binary  tree  T(h)  is  the  graph  whose  (2*+1  -  l)-element 
vertex-set  comprises  all  binary  strings  of  length  at  most  h ,  and  whose  edges 
connect  each  vertex  x  of  length  less  than  h  with  vertices  xO  and  xl.  The 
(unique)  string  of  length  0  is  the  root  of  the  tree,  which  is  the  sole  occupant  of 
level  0  of  the  tree;  the  2l  strings  of  length  t  are  the  level-l  vertices  of  the  tree; 
the  strings  of  length  h  (i.e.,  the  level- A  vertices)  are  the  leaves  of  the  tree. 

•  The  height-h  X-tree  X(h)  is  the  graph  that  is  obtained  from  the  height-A 
complete  binary  tree  T(h)  by  adding  cross  edges  connecting  the  vertices  at 
each  level  of  T(h)  in  a  path,  with  the  vertices  in  lexicographic  order.  X-trees 
inherit  a  level  structure  from  their  underlying  complete  binary  trees. 

•  The  s  *  s  mesh  M(s)  is  the  graph  whose  a2-element  vertex  set  comprises  the 
ordered  pairs  of  integers 

{1, 2, • • • , 5}  x  {1,2,  •  •  • ,«}, 

and  whose  edges  connect  vertices  (a,  6)  and  (c,  d)  just  when  |a-c|-f  |6  d\  --  1. 

All  of  these  networks  have  been  seriously  proposed  as  interconnection  networks 
for  multicomputers  [DP,  Ga,  HZ],  hence  are  important  candidates  for  our  study. 
Another  approach  to  comparing  these  networks,  via  implementation  and  analysis 
of  specific  algorithms,  appears  in  [Ag|. 

Our  results  depend  on  three  structural  features  of  a  graph  G : 

1.  Let  S  and  k  be  positive  integers.  The  n-vertex  graph  G  has  a  k-color  \/2- 
bifurcator  of  size  S  if  either  n  <  2  or  the  following  holds  for  every  way  of 
labelling  each  vertex  of  G  with  one  of  k  possible  labels:  By  removing  .S' 
vertices  from  G,  one  can  partition  G  into  subgraphs  Gj  and  G2  such  that3 

W  |  1(7,1  -  |G,|  |  <  1. 

(6)  For  each  label  /,  the  number  of  /-labelled  vertices  in  Gj  is  within  1  of  the 
number  of  /-labelled  vertices  in  Gj. 

(c)  Each  of  Gi  and  Gj  has  a  fc-color  \/2-bifurcator  of  size  S/y/ 2. 

2.  A  1/S-2/S  (vertex-) separator  of  G  is  a  set  of  vertices  whose  removal  partitions 
G  into  subgraphs,  each  having  >  |G|/3  vertices;  we  denote  by  E(G)  the  size 
of  the  smallest  1/3-2/3  vertex-separator  of  G. 

3.  When  G  is  planar  and  we  are  given  a  witnessing  planar  embedding  t,  we 
denote  by  <l>,  (G)  the  number  of  vertices  in  G’s  largest  interior  face  in  the 
embedding.  When  (  is  clear  from  context,  we  omit  the  subscript. 

'W<>  denote  l>y  |(.'|  ill**  iitimhrr  <>f  vertic*1#  in  (lie  cr;vpli 
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1.2.  The  Main  Results 


We  prove  three  results  about  optimal  embeddings  in  the  Butterfly  that  lead  to  a 
variety  of  nontrivial  optimal  embeddings. 

Theorem  1  The  complete  binary  tree  T[h)  can  be  embedded  in  a  Butterfly  graph, 
with  simultaneous  dilation  0(1)  and  expansion  0(1). 

Obviously,  the  embedding  of  Theorem  1  is  within  a  constant  factor  of  optimal  in 
both  dilation  and  expansion.  Building  on  the  embedding,  we  obtain  the  following 
general  upper  bound  result. 

Theorem  2  Any  n-vertex  graph  G  having  a  \/2-bifurcator  of  size  S  =  D(logn)  can 
be  embedded  in  a  Butterfly  graph  with  simultaneous  dilation  O(logS)  and  expansion 
0(1). 

We  balance  Theorem  2  with  one  of  the  first  broadly  applicable  results  for  bound¬ 
ing  dilation  from  below. 

Theorem  3  Any  embedding  of  a  nontree  planar  graph  G  in  a  Butterfly  graph  has 
dilation  f 1  ( K>|jc^ )  •  This  bound  cannot  be  improved  in  general. 

Direct  application  of  the  proofs  of  these  results  yields  the  following  optimal 
embeddings. 

Corollary  1  The  height-h  X-tree  X(h )  can  be  embedded  in  a  Butterfly  graph  with 
simultaneous  dilation  0(log/i)  =  0(loglog  |A'(h)|)  and  expansion  0(1).  Any  embed¬ 
ding  of  X{h)  in  a  Butterfly  graph  must  have  dilation  fl(log h)  =  f!(log  log  |X(/i)|). 

Corollary  2  Any  embedding  of  the  sx  s  mesh  M(s)  in  a  Butterfly  graph  must  have 
dilation  H(logs)  =  fl(log  ]M(s)|). 

Corollary  2  betokens  a  mismatch  in  the  structures  of  meshes  and  Butterfly 
graphs,  since  any  expansion-O(l)  embedding  of  any  graph  G  in  B(m)  has  dilation 
0(Iog|G|).< 

‘‘This  follow?  from  the  fact*  that  fl(m)  has  m2”’  vertices  and  diameter  0{m). 
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Theorem  1  and  Corollaries  1  and  2  can  be  interpreted  as  yielding  tight  bounds 
on  the  efficiency  with  which  a  Butterfly  machine  can  simulate  a  complete-binary- 
tree  machine,  an  X-tree  machine,  and  a  mesh-structured  machine,  with  regard  to 
both  delay  (dilation)  and  resource  utilization  (expansion).  Equating  dilation  with 
delay  is  most  appropriate  when  the  machines  are  to  be  run  in  SIMD  mode. 

The  next  three  sections  are  devoted  to  proving  our  main  results. 


2.  COMPLETE  BINARY  TREES 

2.1.  Embedding  Many  Small  Trees  in  a  Butterfly 

It  is  obvious  from  inspection  that  one  can  find  an  instance  of  the  height-(m  -  1) 
complete  binary  tree  T(m  -  1)  rooted  at  every  vertex  of  B(m).  Somewhat  less 
obvious  is  the  fact  that  one  can  find  m  mutually  disjoint  instances  of  T(m  -  1)  as 
subgraphs  of  B(m).  We  now  verify  this  fact  via  an  embedding  which  will  prove 
useful  as  we  develop  our  final  embedding. 

Proposition  1  For  every  integer  m,  one  can  find  m  mutually  disjoint  instances  of 
T(m  -  1)  as  subgraphs  of  B(m). 


Proof.  To  simplify  exposition,  we  represent  sets  of  binary  strings  by  strings  over 
the  alphabet  {(),  !,•>}.  using  *  as  a  wild-card  character.  The  length-*-  string 

0  —  0O0\  •  '  •  0k-  1. 

where  each  0t  e  {0, 1,  +},  represents  the  set  o{0)  of  all  length-*:  binary  strings  that 
have  a  0  in  each  position  »  of  0  where  /3,  =  0,  a  1  in  each  position  i  of  0  where 
0i  =  1,  and  either  a  0  or  a  1  in  each  position  i  of  0  where  0i  =  *.  For  illustration, 
<t(010)  =  {010},  and  cr(0  *  1)  =  {001,011}.  Call  the  string  0  the  code  for  the  set 
O{0). 

On  to  our  embeddings  of  m  instances  of  T(m  -  1)  in  B{m):  For  any  letter  a 
and  nonnegative  integer  k,  we  denote  by  ak  a  string  of  k  a's. 


For  the  first  instance  of  T(m  —  1),  we  have  the  following  correspondence  between 
tree  vertices  and  Butterfly  vertices. 


Tim 

zl 1 

/evel 

0 

(0,  0"*) 

level 

1 

(1,  iO"1-1 

level 

2 

1 

£ 

o 

M 

* 

ci 

level 

m  -  1 

(m  -  1,  *m- 

For  each  subsequent  instance  of  T(m  —  1),  say  the  jth  where  1  <  j  <  m,  we  have 
the  following  correspondence  between  tree  vertices  and  Butterfly  vertices. 

T(m  -  1)  B(m) 

level  0  0  -  1,  0,-,10m-,"ll) 

level  1  (j,  ty-H  *0m->~2l) 

level  2  O'  +  l(tnod  m),  O'*1!  *2  0m->-3l) 


level  m  -  1  (j  -  2,  'J 

The  placement  of  the  l's  in  the  PWL  strings  ensures  that  the  m  instances  of 
7’(m  1)  are  mutually  disjoint.  To  verify  this,  via  contradiction,  let  us  look  at  an 

arbitrary  level  t  of  B(m)  and  at  arbitrary  distinct  tree  vertices  t  and  j  that  collide 
at  some  position  within  level  t  of  B(m).  It  is  clear  that  all  Butterfly  vertices  that 
are  images  of  the  same  instance  of  T(m  -  1)  are  distinct,  so  we  may  assume  that 
vertices  i  and  j  come  from  distinct  instances  of  T(m  -  1),  call  them  t(i)  and  t(j), 
where  the  t-“name”  of  an  instance  of  T(m  -  1)  is  the  level  of  B(m)  where  its  root 
resides.  We  consider  four  cases  that  exhaust  the  possibilities.  In  each  case,  we 
adduce  a  property  of  the  PWL  strings  that  precludes  any  overlap  in  the  images  of 
the  trees. 

t(»)  =  0: 


I 


! 


If  t(0  =  0,  then  the  PWL  string  of  i  ends  with  0m~e,  while  the  PWL 
string  of  j  has  a  1  in  this  range,  specifically,  in  position  m  —  I  if  j  <  £, 
and  in  position  j  if  j  >  t. 

1  <  t(»)  <  0»  1  L- 
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Every  PWL  string  of  i(t)  starts  with  O'  1,  while  every  PWL  string  of  i(j) 
starts  with  (Pi. 

1  <  i(»)  <  t  <  t(j): 

Every  PWL  string  of  t(i)  has  a  0  in  position  j,  while  every  PWL  string 
of  i(j)  has  a  1  in  that  position. 

(  <  i(i)  <  t[j)  <  m: 

Every  PWL  string  of  i(t)  has  a  1  in  position  t,  while  every  PWL  string 
of  /.(j)  has  a  0  in  position  i. 

The  proof  is  complete.  □ 

An  algebraic  proof  of  Proposition  1,  which  is  “cleaner”  than  our  combinatorial 
proof  here,  appears  in  (ABR);  however,  it  is  the  embedding  rather  than  the  result, 
that  will  be  helpful  in  our  proof  of  Theorem  1. 

The  embedding  in  our  proof  of  Proposition  1  does  not  serve  us  directly  in  our 
attempt  to  embed  a  large  complete  binary  tree  in  a  small  Butterfly,  since  (for  one 
thing)  it  places  the  roots  of  every  instance  of  T(m  -  1)  at  a  different  level  of  2?(m); 
and  it  is  not  clear  how  to  combine  these  instances  into  a  bigger  complete  binary 
tree  with  small  dilation.  However,  the  overall  strategy  of  the  embedding  will  be 
useful  in  Section  2.2. D. 


2.2.  Optimally  Embedding  Trees  in  Butterfly  Graphs 

We  turn  now  to  the  proof  of  Theorem  1.  Specifically,  we  prove  the  following. 

For  any  integer  m,  one  can  embed  the  complete  binary  tree  T(m  + 
[logmj  -  l)  in  the  Butterfly  graph  B(m  +  3),  with  dilation  0(1). 

To  simplify  our  description,  let  q  =d,f  m  4-  [logmj  -  1,  and  assume  henceforth 
that  m  is  even;  clerical  changes  will  remove  the  assumption. 

A.  The  Embedding  Strategy 

We  wish  to  embed  the  tree  T[q )  with  dilation  0(1),  in  the  smallest  Butterfly 
that  is  big  enough  to  hold  the  tree,  namely.  B(m).  We  fall  somewhat  short  of  this 
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goal,  but  not  by  much:  We  find  an  embedding  with  dilation  0(1),  but  we  have 
to  use  a  somewhat  larger  host  Butterfly  graph  (specifically,  B(m  +  3))  in  order  to 
resolve  collisions  in  our  embedding  procedure.  Our  embedding  proceeds  in  four 
stages.  Stage  1  embeds  the  top  logm  levels  of  T(q)  with  unit  dilation  in  B(m), 
thereby  specifying  implicitly  the  images  in  B(m)  of  the  roots  of  the  m/2  subtrees  of 
T(q)  rooted  at  level  logm  -  1.  Stage  2  expands  these  subtrees  a  further  m/2  levels, 
but  now  in  B(m  +  1),  with  dilation  2,  thereby  specifying  implicitly  the  images  in 
B(m)  of  the  roots  of  the  m  •  2m|/:_1  subtrees  of  T(q)  rooted  at  level  m/2  +  log  m  -  1 
of  the  tree.  In  Stage  3,  we  embed  the  final  m/2  levels  of  T(q)  in  B(m  +1),  with 
dilation  4.  The  vertex-mappings  in  each  stage  are  embeddings  (i.e.,  are  one-to-one); 
there  is,  however,  “overlap"  (i.e.,  distinct  vertices  of  r(q)  getting  mapped  to  the 
same  vertex  of  B(m  +1))  among  the  mappings  of  the  three  stages.  In  Stage  4,  we 
eliminate  this  overlap  by  expanding  the  host  Butterfly  by  two  more  levels,  thereby 
giving  us  four  connected  isomorphic  copies  of  B(m  +  1).  At  the  cost  of  increasing 
dilation  by  2,  we  modify  our  mapping  so  that  each  of  Stages  1,  2,  3  is  performed  in 
a  distinct  copy  of  B(m  +  1),  thereby  eliminating  all  overlap. 

B.  Stage  1:  The  Top  logm  Levels  of  T(q) 

We  place  the  root  of  T[m  +  logm)  at  position 

(m  -  logm,  Om) 

of  B(m).  We  then  proceed  to  higher-numbered  levels,  embedding  the  top  logm 
levels  of  T(q)  as  a  subgraph  of  B(m),  ending  up  with  the  leaves  of  these  levels  in 
positions 

of  B(m)  (because  of  wraparound).  See  Fig.  3.  We  call  the  rightmost  logm  -  1  bits 
of  each  of  the  resulting  PWL  strings  the  signature  of  the  Butterfly  position  and  of 
the  subtree  rooted  at  that  position.  It  is  convenient  to  interpret  a  signature  as  an 
integer  in  the  range  {0, 1,  •  •  •  ,m/2  -  1},  as  well  as  a  bit  string. 

The  embedding  in  Stage  1  is  trivially  one-to-one,  with  unit  dilation. 

C.  Stage  2:  The  Next  m/2  Levels  of  T{q) 

Call  the  (m/2  +  l)-!evel  subtree  of  T[q)  that  has  signature  k,  the  kth  subtree. 
Our  goal  is  to  embed  the  kth  subtree  in  B(m  +  1)  (with  dilation  2),  so  that  its  2m/2 
leaves  form  the  set  of  positions5 

(m  -  1,  *0  *  0  •  •  •  *  0  *  1  *  0  •  •  •  *  0  *  0?), 

°Tlie  last  bit.  position  is  not  affected  by  this  Stage,  so  is  denoted  “?”. 
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where  the  1  appears  in  the  kth  even  position  from  the  right  (using  O-based  counting); 
call  this  the  signatory  I  of  the  tree  position.  For  instance,  when  m  -  K,  the  second 
Hubtr<*e  has  leaves  in  positions 


(7,  *0*1*0  *  0?) 

of  B{ 9).  Wc  embed  these  (m/2  +  l)-level  trees  by  alternating  binary  and  unary 
branchings  in  B(m  +  1),  starting  at  the  “roots”  placed  at  level-0  vertices  of  B[m-\  1) 
during  Stage  1;  we  place  a  tree-vertex  after  each  unary  branching.  See  Fig.  I. 
Binary  branchings  generate  the  *’s  in  the  code  for  the  set  of  PWL  strings,  while 
unary  branchings  generate  the  0's  and  l’s  in  the  code.  As  a  simple  example:  a 
binary  branching  from  vertex 


(0,  000000011;, 

which  holds  the  root  of  one  of  the  subtrees  planted  during  Stage  1,  generates  vertices 

'1,  *00000011); 

a  unary  branching  thence  generates  vertices 

(2.  i 00000011), 

where  we  place  the  level- 1  vertices  of  the  subtree;  a  second  binary  branching  gen¬ 
erates  vertices 

(3,  1 0.00001 1;; 

a  unary  branching  thence  generates  vertices 

(4,  *0  *  100011), 

where  we  place  the  level-2  vertices  of  the  subtree;  a  subsequent  sequence  of  alter¬ 
nating  binary  and  unary  branchings  finally  embeds  the  desired  set  of  leaf  positions 
in  the  advertised  vertices  of  £(m  +  1). 

This  stage  of  our  embedding  clearly  has  dilation  2.  The  fact  that  that  this  stage 
is  one-to-one  (though  it  may  produce  conflicts  with  the  embedding  from  Stage  1) 
has  two  origins.  First,  we  are  using  levels  0  through  m  of  B(m  +  1)  for  the  m  +  1 
levels  of  this  stage,  so  the  leaves  of  the  embedded  trees  do  not  wrap  around  to 
conflict  with  their  roots.  Second,  each  signatory  1,  whose  placement  identifies  its 
respective  tree,  is  set  “on”  before  the  signature  bits  are  reached  and  altered  by  the 
sequence  of  branchings.  This  is  ensured  by  the  fact  that  we  place  the  signatory  1 
by  counting  from  the  right:  the  signature  bits  occupy  the  rightmost  log  m  -  1  bits 
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Figure  1:  A  logical  view  of  the  next  m/2  levels  of  the  embedding 


•  • 


of  the  I’WL  string;  by  the  time  the  branchings  have  reached  the  t11'  bit  from  the 
right,  only  the  rightmost  (logt)  bits  of  the  signature  are  needed  to  specify  the  next 
position  w-here  branching  occurs.  Hence,  at  the  point  w'hen  we  place  the  signatory 
I  in  the  i"’  position,  the  odd-numbered  positions  to  the  left  of  the  1  are  all  0,  and 
th«'  positions  to  the  right  of  the  1  form  the  binary  representation  of  i,  possibly  with 
leading  O  s. 

D.  Stage  3:  The  Final  m/2  Levels  of  T(q) 

Our  goal  in  Stage  3  is  to  use  the  m  •  2m,/2_I  leaves  of  the  m/2  trees  generated 
in  Stage  2  as  the  roots  of  the  (m/2  +  1)-Jevel  subtrees  comprising  the  bottom  m/2 
levels  of  T(q).  Each  root  has  a  signatory  1,  identifying  the  subtree  it  came  from 
in  Stage  2,  and  a  serial  number  obtained  from  the  odd-numbered  bits  of  its  PWL 
string.  The  signatory  l’s  will  keep  trees  sired  by  different  Stage-2  trees  disjoint;  the 
serial  numbers  will  guard  against  collisions  among  trees  that  were  sired  by  the  same 
Stage-2  tree.  The  main  challenge  here  is  to  achieve  the  embedding  while  the  roots 
of  all  the  trees  reside  at  the  same  level  of  B(m+  1)  (which  is  how  Stage  2  has  placed 
them).  To  accomplish  this,  we  have  the  trees  grow  upward ,  in  the  direction  of  lower 
level-numbers,  for  varying  amounts  of  time,  before  starting  to  grow  downward ,  in 
the  direction  of  higher  level-numbers.  While  growing  either  upward  or  downward, 
a  tree  grows  via  alternating  unary  and  binary  branchings,  so  as  to  preserve  Hu 
serial  number-,  this  alternation  will  incur  dilation  2.  An  additional  dilation  of  2  is 
incurred  while  a  tree  grows  upward:  each  tree  begins  to  grows  upward  using  only 
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Figure  5:  A  logical  view  of  the  final  m/2  levels 


every  fourth  level  of  B(m  +  1);  when  it  “turns"  from  growing  upward  to  growing 
downward,  it  uses  the  levels  it  has  skipped  while  moving  upward  to  regain  level  0 
of  B(tn  f  l),  at  which  time  it  grows  downward  using  every  other  level  of  B(m  -I  1). 
See  Fig.  5.  Thus,  in  all,  this  Stage  of  the  embedding  incurs  dilation  4. 

All  trees  with  the  same  signatory  1  (i.e.,  rooted  at  the  leaves  of  the  same  Stage-2 
tree)  will  grow  in  lockstep.  We  refer  to  the  trees  sharing  a  signatory  1  in  the  k"' 
even  bit-position  as  the  A:th  subtrees  of  T(q),  0  <  k  <  m/2.  We  place  the  vertices  of 
the  klh  subtrees  of  T(q)  into  B(m  -I-  1)  as  follows: 

•  For  the  0th  trees,  we  place  the  2*  level-£  vertices  of  T(q)  at  level  2 i  of  B(m  +  1 ). 

(Thus,  these  trees  grow  downward  immediately.) 

•  For  the  ku'  trees,  k  >  0: 

-  we  place  their  unique  level-0  vertex  at  level  0  of  B(m  +  1)  (in  fact  this 
was  placed  during  Stage  2) 
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-  for  1  <  l  <  [k/ 2 J ,  we  place  their  2l  levels  vertices  at  level  m  -  4(  4  1  of 
B(m  4  1) 

-  if  k  is  odd.  we  place  their  2 level-([Ar/2])  vertices  at  level  m -4 \k/2]  -t 
3  of  B(rn  +  1) 

-  for  \kf  2}  4  1  <  l  <  k,  we  place  their  2l  level-/?  vertices  at  level  m  -  4(k  - 
l)  -  1  of  B(m  4  1) 

Now  we  verify  that  the  described  mapping  is  one-to-one.  hence  an  embedding. 
We  consider  separately  the  two  potential  sources  of  collisions. 

First.,  we  note  that  there  can  be  no  collisions  among  the  2m/2  A:th  trees,  for  any 
Ar,  since  each  of  these  trees  has  a  unique  serial  number. 

Second,  we  note  that,  for  each  fixed  serial  number,  there  can  be  no  collision 
between  the  j,h  and  A-'1’  trees  having  that  serial  number.  This  is  argued  most  easily 
by  considering  how  such  trees  are  laid  out  level  by  level.  To  simplify  exposition,  we 
present  only  the  even  bit-positions  of  the  image  vertices  in  B(m+  1),  since  the  odd 
bit-positions  hold  identical  serial  numbers.  Note  first  that  the  top  k  levels  of  each 
Ar"'  tree  are  placed  in  vertices  of  the  form 

in  B[i»  l  I);  hence,  their  membership  in  a  Arlh  tree  is  announced  by  the  leftmost 
m  2  A-  4  1  even  bit-positions  of  the  PWL  strings.  For  tree-levels  >  k,  the  j,u  and 
k"'  trees  are  distinguished  as  follows.  .Say.  with  no  loss  of  generality,  that  j  k. 
For  each  0  •_  f  \  w ,  2  Ac,  the  level-(Ar  +  f)  vertices  of  each  A:"1  tree  are  placed  at 
vert  ices 

</. 

of  B(m  4  1).  By  the  same  token,  for  each  0  i  £  1  m/2  -  j,  the  level-(j  +  l)  vertices 
of  each  jih  tree  are  placed  at  vertices 


of  B(rn  4  I).  Since  j  k  by  hypothesis,  we  see  that,  at  those  levels  of  B(m  -t  I) 
where  we  place  vertices  of  both  trees,  the  A;th  even  bit-position  from  the  right  of  each 
Ar1  *•  tree  contains  a  1,  while  the  corresponding  bit-position  of  each  jlh  tree  contains 
a  0. 

Thus,  the  mapping  in  this  stage  is  an  embedding. 

E.  Resolving  Collisions 
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Figure  6:  Replicating  B(m  +  1)  to  avoid  collisions 


We  now  have  three  subembeddings  that  accomplish  the  desired  task,  except,  for 
tlu*  fact  that  Stage  i  and  Stage  j  may  map  different  tree  vertices  to  the  same  But¬ 
terfly  vertex.  We  resolve  these  possible  collisions  as  follows.  Instead  of  performing 
the  subembeddings  in  B(m  +  1).  we  perform  them  in  B(m  f  3),  placing  each  sube- 
mhedding  in  a  distinct  copy  of  fl(m  f  1).  We  make  the  transition  between  copies 
of  li[m  I  1)  as  follows.  As  the  Stage-1  embedding  of  the  top  of  T(q)  reaches  level 
hi  I  of  its  copy  of  B(m  -t  1),  we  use  a  sequence  of  unary  branchings  in  B(m  t  3)  to 
reach  level  0  of  the  next  copy  of  B(m  +1).  We  perform  the  Stage-2  subentbedding 
within  this  second  copy;  this  takes  us  to  level  m  -  1  of  that  copy,  where  a  sequence 
of  unary  branchings  in  B(m  +  3)  takes  us  to  level  0  of  the  third  copy  of  B(m  -f  1). 
We  perform  the  Stage-3  subembedding  in  this  third  copy.  See  Fig.  6.  The  transi¬ 
tion  from  level  in  -  1  of  the  second  copy  of  B(m  4-  1)  to  level  0  of  the  third  copy 
engenders  dilation  4. 

The  embedding,  hence  the  proof,  is  now  complete.  □ 

2.3.  The  Issue  of  Optimality 

Theorem  1  settles  for  an  embedding  of  complete  binary  trees  in  Butterfly  graphs, 
that  achieves  dilation  0(1)  and  expansion  0(1)  simultaneously.  While  this  achieves 
our  overall  goal  of  optimality  to  within  constant  factors,  it  does  leave  open  the 
possibility  of  those  constant-factor  improvements.  We  have  been  unable  to  deter¬ 
mine  exact  dilation-expansion  tradeoffs  for  embeddings  of  complete  binary  trees  in 
Butterfly  graphs,  but  we  can  show  easily  that  it  is  impossible  to  optimize  both  cost 
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measures  simultaneously.  Thus,  one  cannot  hope  for  the  level  of  “perfection”  found 
in,  say,  [GHR]1'. 

Proposition  2  No  embedding  of  T(q)  in  B(m  +  1)  has  unit  dilation. 

Froof.  Both  complete  binary  trees  and  Butterfly  graphs  are  bipartite  graphs:  one 
can  color  the  vertices  of  either  graph  red  and  blue  in  such  a  way  that  every  edge 
connects  a  red  vertex  and  a  blue  one.  For  any  Butterfly  graph  B(r),  on  the  one 
hand,  the  numbers  of  red  and  blue  vertices  are  within  r  of  being  equal;  for  any 
complete  binary  tree,  on  the  other  hand,  one  of  the  sets  has  roughly  twice  as  many 
vertices  as  the  other.  Thus,  one  cannot  find  a  unit-dilation  embedding  of  a  complete 
binary  tree  in  the  smallest  Butterfly  graph  that  has  enough  vertices  to  hold  it.  n 


3.  UPPER  BOUNDS  -  THEOREM  2 

This  section  is  devoted  to  proving  Theorem  2.  Since  all  of  the  relevant  ideas  in  the 
proof  are  present  in  its  application  to  specific  families  of  graphs,  we  actually  prove 
only  the  upper  hound  of  Corollary  1.  The  reader  should  be  able  to  generalize  easily 
to  arbitrary  families  of  graphs,  thereby  proving  Theorem  2.  For  the  remainder  or 
the  Section,  we  Iherelore  focus  on  the  problem  of  embedding  X-trees  in  Butterflies. 

Our  embedding  of  the  X-tree  in  the  Butterfly  graph  is  indirect:  First  we  find  a 
unit-expansion,  dilation-G>(log  log  n)  embedding  of  X(h)  in  T(h).  Then  we  compose 
this  embedding  with  the  expansion-O(l),  dilation-O(l)  embedding  of  T(h)  in  B(m) 
from  Theorem  1 .  to  obtain  the  upper  bound  of  Theorem  2.  We  discuss  here  only  the 
former  embedding,  which,  in  fact,  embeds  the  X-tree  A(m)  in  the  complete  binary 
tree  T(m).  For  notational  simplicity,  let  n  — def  2m+l  -  1,  the  number  of  vertices  in 
A(m).  We  devote  this  section  to  proving  the  following. 

Proposition  3  For  any  integer  m,  one  can  embed  the  X-tree  X(m)  in  the  complete 
binary  tree  T(m).  with  dilation  O(logm)  =  O(loglogn). 

Using  the  obvious  fact  that  the  n-vertex  X-tree  can  be  bisected  (in  the  sense  of 
statement  1  above)  by  removing  O(logn)  edges,  coupled  with  lechr  iques  in  Section 
I  of  |B|j|.  the  reader  can  easily  prove  the  following. 

‘  In  |CIIHI  a  variant  of  D{tn)  with  tn>  wraparound  is  embedded  ill  the  Hypercnbe  with  unit 
dilation  and  opt  imal  expansion. 


I 


1 


J 


I 


I 


17 


Lemma  1  For  all  positive  integers  n,k,  the  n-vertex  X-tree  has  a  k-color  \f2- 
bifurcator  of  size  S  =  2k  ■  log  n. 

Proof  of  Proposition  S.  Our  embedding  uses  the  following  auxiliary  structure,  which 
appears  (in  slightly  different  form)  in  (BCLR).  A  bucket  tree  is  a  complete  binary 
tree,  each  of  whose  level-l  vertices  has  (bucket)  capacity 

‘  •><*(§) 

for  some  fixed  constant  c  to  be  chosen  later  (in  Lemma  2).  We  embed  A'(w) 
in  T(m)  in  two  stages:  First,  we  embed  X{m)  in  a  bucket  tree,  via  a  many-to-one 
function  p  that  “respects”  bucket  capacities  (always  placing  precisely  c  ■  log((2m ' 1  - 
l)/2/)  vertices  of  A'(m)  in  each  level-£  vertex  of  the  bucket  tree)  and  has  constant 
“dilation”.  Then  we  “spread"  the  contents  of  the  bucket  tree’s  buckets  within  T(m), 
to  achieve  an  embedding  of  A'(m)  in  T(m),  with  the  claimed  dilation.  Formally, 
the  first  stage  of  the  embedding  is  described  as  follows. 

Lemma  2  Every  X-tree  A'(m)  can  be  mapped  onto  a  bucket  tree  in  such  a  way  that: 
(a)  exactly 

( 2m+l  -  1\ 

N{t)  =  14  log  f - J+24 

vertices  of  A f(m)  are  mapped  to  each  level-l  vertex  of  the  bucket  tree,  and 

(l>)  vertices  that  are  adjacent  in  A‘(m)  are  mapped  to  buckets  that  are  at  most 

distance  5  apart  in  the  bucket  tree. 

The  constants  in  the  expression  for  N{()  can  be  reduced  by  increasing 
the  constant  5  in  part  (b)  of  the  Lemma  (say,  to  10).  We  suffer  the 
larger  constants  in  order  to  simplify  the  technical  development  in  the 
proof.  The  interested  reader  can  easily  mimic  our  development  with 
other  constants. 

Proof.  The  basic  idea  is  to  recursively  bisect  A f(m),  using  a  5-color  \/2-bifurcator 
(the  uses  of  the  colors  will  become  clear  momentarily),  placing  successively  smaller 
sets  of  v^-bifurcator  vertices  in  lower-level  buckets  of  the  bucket  tree.  We  also 
place  other  vertices  in  the  buckets,  in  order  to  ensure  the  desired  “dilation”  and 
in  order  to  ensure  that  all  buckets  are  filled  to  capacity.  The  formal  description 
of  the  mapping  will  require  two  iterations.  First,  we  present  a  mapping  procedure 
that  establishes  the  sufficiency  of  the  quantities  N(i)  as  bucket  capacities.  Then 
we  refine  the  initial  mapping  to  complete  the  proof. 
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We  simplify  our  description  of  this  technically  cumbersome  procedure  in  two 
ways.  First,  we  describe  in  detail  what  the  procedure  would  look  like  if  we  were 
using  S-color  bifurcators  rather  than  5-color  bifurcators;  the  reader  should  be  able 
to  extrapolate  from  oor  description  to  arbitrray  numbers  of  colors.  Second,  we 
establish  the  following  notation. 

•  We  denote  by  Bx,  where  X  denotes  the  null  string  (i.e.,  the  string  of  length  0) 
over  the  alphabet  {1,2},  the  bucket  at  the  root  of  the  bucket  tree. 

•  In  general,  letting  i  denote  any  string  over  the  alphabet  {1,2},  we  denote 
by  Bz i  and  Bx 2  the  buckets  at  the  children  of  the  vertex  of  the  bucket  tree 
having  bucket  Br;  for  example,  Bj  and  B2  denote  the  buckets  at  the  children 
of  the  rool  vertex  of  the  bucket  tree.  B\\  and  B i2  denote  the  buckets  at  the 
left  grandchildren  of  the  root  vertex.  Bn  and  B22  denote  the  buckets  at  the 
right  grandchildren  of  the  root  vertex,  and  so  on. 

Algorithm  Bucket:  Mapping  X(m)  into  a  bucket  tree 


Step  1.  Initial  coloring  and  bisection. 

l.a.  Initialize  every  vertex  of  X(m)  to  color  A. 

l.b.  Associate'  the  graph  A'(m)  with  the  root  of  the  bucket  tree. 

l.c.  Bisect  A  (m).  to  obtain  subgraphs  X\  and  .V2,  and  place  the  v^-bifurcator 
vertices  in  bucket  Bx- 

l.d.  Recolor  every  A-colored  vertex  of  X{m)  that  is  adjacent  to  a  vertex  in 
bucket  Bx  with  color  0. 

1. e.  Associate  X,  (1  €  {1,2})  with  the  child  of  the  root  vertex  of  the  bucket. 

tree  holding  bucket  B,. 

Step  2.  Second-level  bisection. 

2, a.  Use  a  2-color  \/2-bifurcator  for  each  X,,  to  create  subgraphs  X,\  and 

X,2. 

2.b.  Place  the  v^-bifurcator  vertices  for  each  X,  in  the  corresponding  bucket 
B,  of  the  bucket  tree. 

2,c.  Recolor  every  A-coIored  vertex  of  X(m)  that  is  adjacent  to  a  vertex  in 
bucket  B,  with  color  1. 

Tin*  line  are  intended  to  make  it  easier  for  the  reader  to  follow  our  description 

nf  1  Ik-  mapping. 


2. d.  For  each  X,,  associate  each  subgraph  Xt]  with  the  \/2-bifurcator-tree 

vertex  associated  with  bucket  2?,,. 

Step  3.  Third-level  bisection. 

3. a.  Use  a  3-color  \/2-bifurcator  for  each  X,y,  to  create  subgraphs  A'tJi  and 

Xu* 

3.b.  Place  the  \/2-bifurcator  vertices  for  each  X,y  in  the  corresponding  bucket 
Bij  of  the  bucket  tree. 

3.c.  Recolor  every  ^-colored  vertex  of  X(m)  that  is  adjacent  to  a  vertex  in 
bucket  with  color  0. 

3.d.  For  .each  X,y,  associate  each  subgraph  X,,*  with  the  \/2-bifurcator-tree 
vertex  associated  with  bucket  27, y*. 

Step  s.  (4  <  s  <  m)  All  remaining  bisections. 

s.a.  For  each  subgraph  Xv  (y  €  {1,2})  of  X(m)  created  in  Step  s  -  1.  place 
every  vertex  of  color  s  (mod  2)  in  the  associated  bucket  Bv. 

s.b.  Use  a  3-color  %/2-bifurcator  for  each  X„,  to  create  subgraphs  XV|  and 

AV 

s.c.  Place  the  \/2-bifurcator  vertices  for  each  Xv  in  the  corresponding  bucket. 
By  of  the  bucket  tree. 

s.d.  Recolor  every  A-colored  vertex  of  X(m)  that  is  adjacent  to  a  vertex  in 
bucket  By  with  color  length{y)  (mod  2). 

s.e.  For  each  Xy,  associate  each  subgraph  Xy,  with  the  \/2-bifurcator-tree 
vertex  associated  with  bucket  By,. 

We  now  analyze  5-color  analogue  of  the  described  mapping ,  to  show  that  it 
satisfies  the  demands  of  Lemma  2,  with  the  requirement  of  “exactly”  N{1)  vertices 
per  level-2  bucket  replaced  by  “no  more  than”  N{i)  vertices  per  level-2  bucket,  i.e., 
to  show  that  our  bucket  capacities  are  big  enough.  Since  the  “dilation”  condition 
(b)  is  transparently  enforced  when  certain  colored  vertices  are  automatically  placed 
in  buckets  (in  Step  s.a),  it  will  suffice  to  establish  that  the  populations  of  the 
buckets  are  as  indicated  in  the  modified  condition  (a).  This  follows  by  the  following 
recurrence,  wherein  N(k)  denotes  the  number  of  vertices  of  X{m)  that  get  mapped 
into  a  bucket  at  level  k  -  1  of  the  bucket  tree. 

jv(*)  <  f^iV<*:-5)l+6lo*(?) 

<  ^AT(t-5)  +  10log(pj 
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with  initial  conditions 


•  iV  ( 1 )  <  2  log  n 

•  N  (2)  <  4  log  (f) 

.  jV (3)  <  6 log  (a) 

.  N(  4)  <8  log  (f) 

.  yV( 5)  <  10  log  (75) 

The  initial  conditions  reflect  the  sizes  of  the  appropriately  colored  \/2-bifurcatorfi 
of  X(m):  At  each  level  (,  1  <  t  <  4,  one  uses  an  ^-colored  %/2-bifurcator,  followed 
by  a  5-color  \/2-bifurcator  at  all  subsequent  levels.  At  levels  s  >  2,  the  buckets 
contain  not  only  v2-bifurcator  vertices,  which  account  for  the  term 

10,06  (?) 

in  the  general  recurrence;  they  contain  also  the  vertices  of  A'(m)  that  are  placed 
to  satisfy  the  ‘‘dilation”  requirements.  The  latter  vertices  comprise  all  neighbors  of 
t  he  N( k.  5)  occupants  of  the  distance-4  ancestor  bucket  that  have  not  yet  been 
placed  in  any  other  bucket.  Since  vertices  of  A'(m)  can  have  no  more  than  five 
neighbors,  and  since  our  5-color  bisections  allocate  these  neighbors  equally  among 
the  descendants  of  a  given  bucket,  these  “dilation”-generated  vertices  can  be  no 
more  than 

in  number.  These  two  sources,  the  V^-bifurcators  and  their  neighbors,  account  for 
the  occupants  of  the  buckets  and  for  the  recurrence  counting  them.  To  complete 
the  proof  of  the  modified  Lemma,  one  now  shows  by  standard  techniques  that  the 
indicated  recurrence,  with  the  indicated  initial  conditions,  has  the  solution 

N{k)  <  14  log  (^)  +24. 

Finally,  we  turn  to  the  original  form  of  the  Lemma.  This  follows  from  the 
modified  form,  upon  refining  the  Algorithm  by  adding  the  following  substeps  at  the 
indicated  points. 


Figure  7:  Unloading  the  buckets 

At  the  end  of  each  step  of  the  Algorithm,  when  we  have  finished  filling 
a  bucket  Bz  (x  €  {1,2}  )  with  vertices  obtained  from  a  recent  bisection 
or  from  our  desire  to  maintain  small  “dilation”,  we  check  the  population 
of  the  bucket  against  the  ceiling  population  jV(£),  where  t  —  length{x). 

If  the  bucket  contains  fewer  than  N(t),  vertices,  then  we  add  enough 
new  vertices  to  it  from  the  remaining  associated  subgraph  to  fill  it  to 
capacity. 

This  last  step  ensures  that  all  buckets  at  level  t  of  the  bucket  tree  contain  exactly 
N(()  vertices.  □ 

Our  final  task  is  to  refine  the  “dilation”-5  mapping  of  Lemma  2  to  a  bona  fide 
embedding  of  X(m)  in  T(m),  having  dilation  O(loglogn).  We  proceed  inductively, 
emptying  buckets  into  T(m)  in  such  a  way  that  each  tree  vertex  is  assigned  a 
unique  X-tree  vertex.  In  general,  we  denote  by  Tz  the  smallest  subtree  of  T(m) 
that  is  rooted  at  level  length(x)  of  T{m)  and  that  contains  the  contents  of  bucket 
Bz.  (In  general,  the  contents  of  Bz  will  occupy  only  the  last  few  levels  of  Tz.)  See 
Fig.  7. 

•  Place  the  log  n  elements  of  bucket  B\  in  the  topmost  copy  of  T(loglogn)  in 
T’(m),  in  any  way. 

•  Consider  the  subtrees  of  T\  rooted  at  level  1  of  T(m).  Place  the  contents  of 
bucket  By  in  the  (roughly)  log  log  n  levels  of  the  leftmore  of  these  two  subtrees, 
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starting  immediately  after  the  leaves  of  7V  Place  the  contents  of  bucket  f?2 
analogously,  using  the  rightmore  of  these  two  subtrees,  starting  immediately 
after  the  leaves  of  T\.  We  have  thus  implicitly  defined  the  subtrees  Tx  and  T2. 

Note  that  by  this  point,  we  are  using  enough  of  the  top  levels  of  T(m)  that  we  need 
use  only  one  more  level  in  order  to  place  the  contents  of  the  next  level  of  buckets. 
The  importance  of  this  fact  is  that  it  guarantees  that  all  of  the  subtrees  Tx  will 
have  height  O(loglogn).  (Namely,  Tx,  Tx,  and  T2  have  the  desired  height,  and  all 
subsequent  trees  will  result  from  adding  one  level  of  leaves  to  a  tree  whose  root  is 
one  level  lower  in  T(m)  than  was  its  father’s  root.) 

•  Proceeding  inductively,  assume  that  we  have  filled  subtrees  T,  of  T(m)  with 
bucket  contents,  for  strings  x  €  {1,2}  of  length  <  l.  We  now  consider 
the  subtrees  of  T(m)  rooted  at  level  t  +  1;  each  subtree  Tt  rooted  at  level 
i  thus  spawns  two  children.  We  order  these  2t+l  subtrees  from  left  to  right, 
according  to  the  lexicographic  order  on  the  subscript-strings  x.  We  then  place 
the  contents  of  the  bucket  BzX  in  the  leaves  of  the  leftmore  of  the  children  of 
Tx ,  beginning  where  the  contents  of  bucket  Bz  left  off.  Analogously,  we  place 
the  contents  of  the  bucket  i?l2  in  the  leaves  of  the  rightmore  of  the  children 
of  Tz,  beginning  where  the  contents  of  bucket  Bt  left  off. 

The  described  procedure  clearly  produces  an  embedding  of  Ar(m)  in  T(m),  since 
each  vertex  of  A'(m)  is  assigned  to  a  unique  tree  vertex.  Additionally,  the  em¬ 
bedding  has  unit  expansion  since  no  tree  vertices  are  passed  over  in  the  assign¬ 
ment.  process  and  since  all  buckets  at  each  level  i  have  the  same  population  N(f) 
(so  all  subtrees  Tt  are  isomorphic).  Finally,  the  procedure’s  method  of  spreading 
bucket  contents  throughout  T(m)  produces  an  embedding  with  the  desired  dilation, 
namely,  O(loglogn).  Specifically,  by  always  spreading  the  contents  of  buckets  Bt i 
and  Bz2  in  the  leaves  of  the  left  and  right  subtrees  of  the  depth-O(loglogn)  subtree 
that  contains  the  contents  of  bucket  Bz ,  the  procedure  guarantees  that  the  least 
common  ancestor,  in  T(m),  of  the  set  comprising  the  contents  of  any  bucket  plus 
the  vertices  in  buckets  at  most  five  buckets  up  (which  will  lie  in  adjacent  levels 
fc.fc  +  l,fc  +  2, fe  +  3,fc  +  4,fc  +  5  of  the  bucket  tree)  are  always  within  a  subtree 
of  height  O(loglogn)  of  T(m).  Thus,  we  have  produced  the  desired  embedding, 
thereby  proving  Proposition  2,  hence  Theorem  2.  □ 

4.  LOWER  BOUNDS  -  THEOREM  3 

We  demonstrate  the  near-optimality  (to  within  constant  factors)  of  the  embeddings 
of  Section  3  -  in  fact,  true  optimality  for  X-trees  -  by  proving  the  lower  bound  of 
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Theorem  3.  In  contrast  -with  Theorem  2,  Theorem  3  is  most  easily  proved  in  its  full 
generality. 

Assume  henceforth  that  we  are  given  a  planar  graph  G ,  a  planar  embedding  t 
of  G ,  and  a  minimum-dilation  embedding  p  of  G  in  B(p);  let  p  have  dilation  6. 

We  begin  by  noting  that  we  can  simplify  our  quest  somewhat.  Specifically,  since 
we  aim  only  for  bounds  that  hold  up  to  constant  factors,  we  lose  no  generality  by 
assuming  henceforth  that  (in  the  embedding  e)  the  exterior  face  of  G  is  a  simple 
cycle: 

Lemma  3  One  can  add  edges  to  the  graph  G  within  the  embedding  (  in  such  a  way 
that 

•  the  resulting  embedding  e'  is  a  planar  embedding  of  the  resulting  graph  G" 

•  in  the  embedding  the  exterior  face  of  G1  is  a  simple  cycle 

•  E(C')  =  0(E(G)) 

•  =  max(3,$,(G)). 

Proof  Sketch.  If  the  exterior  face  of  G  is  not  a  simple  cycle,  it  is  because  of  cut- 
edges  and/or  pinch-vertices.  We  take  each  cut-edge  in  turn  and  create  a  triangle 
containing  it  as  an  edge;  then  we  repeat  the  process  with  any  remaining  cut-edge. 
When  no  more  cut-edges  exist,  we  eliminate  each  pinch-vertex  in  turn  by  creating 
a  triangle  that  includes  the  pinch-vertex  as  a  vertex.  Since  each  added  edge  creates 
a  triangle  and  spans  only  two  edges  of  G,  the  claims  about  $(G‘)  and  E(G')  are 
immediate.  □ 

A  consequence  of  Lemma  3  is  that  we  may  henceforth  assume  that  every  edge 
of  G  resides  in  some  interior  face  (in  the  embedding  e). 

We  turn  now  to  the  quantitative  consequences  of  Lemma  3. 

A  set  of  faces  of  G  is  connected  in  the  embedding « just  when  their  corresponding 
vertices  are  connected  in  the  graph  r(G;«)  whose  vertices  are  the  faces  of  G  and 
whose  edges  connect  a  pair  of  face-vertices  just  when  the  faces  share  a  vertex.  A 
set  5  of  vertices  of  G  is  face-connected  (in  e)  if  the  set  of  interior  faces  of  G  that 
contain  one  or  more  of  the  vertices  of  S  is  connected. 

Let  A  be  a  connected  component  of  the  graph  G  remaining  after  removing  a 
set  5  of  vertices  from  G.  The  S -boundary  of  A  is  the  set  of  vertices  of  A  that  are 
adjacent  (in  G)  to  vertices  of  S. 
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Lemma  4  If  one  removes  a  face- connected  set  of  vertices  S  from  the  graph  G, 
then  the  S -boundary  of  every  resulting  maximal  connected  component  of  G  is  face- 
connected. 

Proof.  Consider  a  maximal  connected  component  A  remaining  after  removing  5 
from  G.  Assume  for  contradiction  that  the  set  of  5-boundary  vertices  of  A  is 
not  face-connected.  There  must  then  be  at  least  two  distinct  maximal  connected 
components,  call  them  Ft  and  F2,  of  interior  faces  that  contain  boundary  vertices 
(so  F,  U  Ft  is  not  connected).  Let  /,,  *  =  1,2,  be  an  interior  face  in  component  F,. 
and  let  b,  be  a  boundary  vertex  in  face  /,  Since  each  edge  of  G  lies  in  an  interior 
face,  we  can  choose  each  f,  to  contain  a  vertex  of  5  as  well  as  a  boundary  vertex. 

Fact  1  There  is  a  connected  set  I  of  interior  faces,  none  of  which  contains  a  bound¬ 
ary  vertex,  such  that  I  separates  f\  from  f2. 

Verification.  It  is  not  possible  for  both  F\  to  encircle  /j  and  F2  to  encircle  fx,  since 
then  Ft  and  F2  would  intersect  (so  /j  and  /2  would  be  connected  by  interior  faces 
containing  boundary  vertices).  Without  loss  of  generality,  say  that  F\  does  not 
encircle  /2. 

Let  J  be  the  set  of  interior  faces  that  do  not  contain  boundary  vertices  and  that 
are  incident  to  the  outer  boundary  of  Fi  (so  that  /2  is  on  the  outside).  By  definition, 
the  set  J  separates  fx  from  /2.  If  J  is  connected,  then  it  is  the  desired  set  /.  If  J 
is  not  connected,  then  adding  the  exterior  face  of  G  to  J  yields  a  connected  set  J' . 
Moreover,  /2  must  lie  in  one  of  the  simply  connected  regions  J"  of  J' .  Deleting  the 
exterior  face  from  J"  then  yields  the  desired  set  7;  see  Fig.  8. 

Fact,  2  I  contains  a  vertex  of  A  and  a  vertex  of  5. 

Verification.  I  separates  fx  from  /2,  yet:  ft  and  /2  both  contain  vertices  of  both  A 
and  5;  both  A  and  5  are  face-connected  in  G. 

Since  7  contains  vertices  of  A  and  5  and  is  connected,  and  since  5  separates  the 
connected  set  A  from  the  rest  of  G,  the  set  7  must  contain  at  least  one  face  that 
contains  both  a  vertex  from  A  and  a  vertex  from  5.  Such  a  face  must  also  contain 
a  vertex  of  the  5-boundary  of  A,  contradicting  Fact  1.  Lemma  4  follows.  □ 

A  set  of  vertices  5  of  a  graph  K  is  d-quasi-connected,  d  a  positive  integer,  if  for 
every  two  vertices  u,  w  of  5,  there  exists  a  chain  of  vertices 

v  -  v,uvx,v2,...,vk  -  w, 

of  S,  where  consecutive  vertices  v<,  vl+1  are  distance  <  d  apart  in  K . 
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Figure  8:  (a)  The  embedding  f,  with  the  set  J  outlined  boldly;  “x”  marks  /j,  and 
“X”  marks  Fj  with  the  holes  filled  in.  (b)  The  set  J'  =  J  U  (outer  face),  (c)  The 
embedding  e,  as  in  (a),  with  the  set  J"  outlined  boldly. 

Lemma  5  The  Fundamental  Lemma  for  Butterfly-Like  Graphs 
Say  that  there  is  a  subgraph  K  of  G  and  a  constant  c  such  that 

•-  K  is  9(G) -quasi-connected 

•  the  image  of  K  under  the  embedding  p  lies  within  c9(G)f>  consecutive  levels 

of  B(p). 

Then  6  >  a(c  )>%$•  «*«*  a(c)  is  a  constant  depending  only  on  c. 


Proof.  Say  that  the  image  of  H  under  p  lies  entirely  in  levels8 

/  +  1,/  + 2,  ••*,/  +  c$(G)J 

of  B(p).  Let  u  and  v  be  arbitrary  vertices  of  H  which  are  connected  by  a  path  of 
at  most  9(G)  vertices  in  G.  The  image  of  this  path  in  B(p)  must  lie  totally  within 
levels 

l-9(G)6  +  1,  •••,!  + (c-f  l)$(G)6 

of  B(p ),  since  the  embedding  p  has  dilation  6.  See  Fig.  9.  Since  K  is  $(G)-quasi- 
connected,  this  means  that  the  PWL  strings  of  all  images  of  vertices  of  K  can  differ 

8  All  addition  is  modulo  p. 
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Figure  9:  Illustrating  the  Fundamental  Lemma,  with  l  =  c$(G)j  and  h  =  $(G)6: 

Vertices  of  K  reside  in  region  II;  length-#  (G)  paths  between  vertices  of  K  cannot 
extend  beyond  regions  I  or  III. 


only  in  some  set  of  at  most  ((c  +  2)$((7)  +  \)6  bit  positions.  It  follows  that  l\ 
mu  contain  no  more  than  vertices,  i.c.,  rd>((,')b  levels  of 

with  at  most  1  vertices  per  level.  In  other  words. 

c$(G)a2(le+l,*<c)+1,#  >  \K\, 


whence  the  result.  □ 

We  now  complete  the  proof  of  Theorem  3,  beginning  with  two  simple  lemmas. 

Lemma  6  Any  face-connected  set  of  vertices  of  G  is  $(G)-quasi-connectcd. 

Proof  Sketch.  Any  vertex  in  an  / -vertex  face  is  distance  <  ff  2  from  any  neighboring 
face.  D 


Lemma  7  Let  C  be  a  set  of  vertices  of  the  graph  G  whose  removal  partitions  G 
into  connected  components  all  of  size  <  \G\/2.  Then  C  is  a  1/3-2/3  separator  of  G. 


Proof  Sketch.  Remove  C  from  G,  order  the  resulting  connected  components  by  size 
into  decreasing  order,  and  lump  the  components  into  two  piles  as  follows. 

•  Place  the  largest  component  into  the  left  pile. 

•  Place  as  few  of  the  largest  remaining  components  in  the  right  pile  as  possible 
until  the  right  pile  is  bigger  than  the  left. 

•  Now  alternate  piles,  adding  as  few  of  the  largest  remaining  piles  as  possible 
to  the  smaller  pile  until  the  smaller  first  becomes  bigger  than  the  larger  pile. 

Clearly,  when  one  has  completed  the  two  piles,  the  larger  cannot  be  bigger  than  the 
smaller  by  more  than  the  size  of  the  third  largest  component,  i.e.,  by  more  than 
| C* | / 3  vertices.  It  follows  that  each  pile  must  contain  at  least  |G|/3  vertices,  whence 
the  claim.  □ 

Theorem  3  will  now  follow  from  the  next  Lemma. 

Lemma  8  The  embedding  p  must  have  dilation  6  > 
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Proof.  Partition  B(p)  into  bands ,  each  band  /?,  being  a  sequence  of  d,6  consecutive 
levels,  2$(G)  <  d,  <  4$(G),  where  the  constants  d,  may  be  chosen  in  any  way  that 
achieves  a  partition.  Let  k(v),  the  color  of  vertex  v  of  G,  be  the  index  t  of  the  band 
in  which  p(v)  resides. 

We  perform  a  modified  breadth-first  search  of  G,  to  find  a  ♦(G)-quasi-connected 
component  of  size  >  E(G),  all  of  whose  vertices  have  images  in  a  single  band  of 
B(p),  hence  the  same  color.  By  Lemma  5,  the  existence  of  such  a  component  will 
yield  the  lower  bound  on  6. 

The  breadth-first  search  proceeds  as  follows.  We  select  an  arbitrary  vertex  v<, 
of  G  and  form  Vo,  the  maximal  connected  component  of  G  that  contains  v0  and 
that  consists  entirely  of  vertices  with  color  «(vo)-  Since  V0  is  connected,  removing 
its  vertices  partitions  G  into  connected  components;  let  C0  be  the  largest  of  these. 
Lemmas  4  and  6  assure  us  that  the  Vo-boundary,  Bo,  of  the  component  C',>  is  $(G)- 
quasi-connecled.  It  follows  that 

Fart.  3  All  vertices  of  Bn  have  the  same  color. 

Verification.  Since  each  v  €  B0  is  adjacent  to  a  vertex  of  Vo,  we  must  have  «(t>)  <= 
{*(»'„)  -  I.K(t’n)  4  1}.  Moreover,  Bn  cannot  contain  vertices  of  both  colors:  Two 
such  vertices  would  be  separated  by  the  band  /?*(„„),  contradicting  the  fact  that  B„ 
is  <J>(G')-quasi-connected. 

Next,  form  \  the  maximal  monochromatic  subgraph  of  G  that  contains  both  Bn 
and  all  connected  components  of  G  that  intersect  Br,;  obviously,  V,  is  $(G)-quasi- 
connected.  so  removing  it  partitions  G  into  some  number  of  connected  components. 
Lot  C\  be  the  largest  of  these,  and  let  B\  be  the  V, -boundary  of  C\.  As  with  B(l, 
one  shows  that  B,  is  $(6')-quasi-connected  and  monochromatic. 

We  continue  in  this  fashion,  constructing,  in  turn,  for  i  =  2,3, ... ,  the  following 
subgraphs  of  G,  with  the  indicated  properties: 

•  Vx:  the  ($(G)-quasi-connected)  maximal  monochromatic  subgraph  of  G  that 
contains  both  B,_t  and  all  connected  components  of  G  that  intersect  B<_  t 

•  C,:  the  largest  connected  component  of  G  remaining  when  one  removes  V, 
from  G 

•  /?,:  the  ($((7)-quasi-connected,  monochromatic)  F.-boundary  of  C, 

One  continues  this  construction  until  some  subgraph  V,  contains  at  least  D(G) 
vertices.  We  now  show  that  this  point  must  occur. 
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Fact  4  For  some  i,  |V,|  >  E(G). 


Verification.  Note  that  at  each  point  in  our  construction,  V,  is  whittled  out  of  the 
largest  component  C,_j  of  G  remaining  after  removal  of  V,_j  from  G.  Moreover,  V, 
disconnects  the  vertices  of  C,_i  -  V,  from  the  remainder  of  G,  as  one  can  verify  easily 
by  induction  on  i.  At  some  point,  therefore,  the  whittling  process  must  reduce  the 
size  of  the  then-current  largest  component  Cm  so  that  |Cm|  <  |G|/2.  By  Lemma  7, 
the  then-current  ^nisa  1  / 3-2/3  separator  of  G,  hence  must  contain  at  least  E(G') 
vertices. 

The  preceding  development  gives  us  a  set  of  vertices,  of  size  >  E(G),  whose 
images  reside  in  a  single  band  of  4  levels  of  B{p).  By  Lemma  5,  Theorem  3  follows. 

□ 

5.  THE  COROLLARIES:  X-TREES  AND  MESHES 

Corollaries  I  and  2  now  follow  from  the  following  Lemmas. 

Lemma  9  |HR|  S(X(/i))  =  fl (/».)  =  n(log  |A'(/i)|),  and  ♦(X(h))  -  5  (under  the 
natural  embedding). 


Lemma  10  (e.g.,  |HR|)  E(A/(s))  =  fl(s)  =  fl(^|M(s)|),  and  Q(M(s))  =  4  (under 
the  natural  embedding). 

6.  CONCLUDING  REMARKS 

We  close  with  some  remarks  about  extensions  to  the  research  described  here. 

The  lower  bound  of  Theorem  3  cannot  be  improved  in  general,  as  one  can  see 
from  considering  homeomorphs  of  the  mesh. 

Our  lower  bound  for  the  mesh  extends  also  to  higher-dimensional  meshes  and 
to  pyramid  graphs;  thus,  these  are  examples  of  other  popular  networks  that  embed 
efficiently  in  the  Hypercube,  but  not  in  butterfly-like  machines. 

The  lower  bound  of  Theorem  3,  which  deals  explicitly  only  with  embeddings  in 
the  Butterfly,  extends  to  embeddings  in  the  mesh  of  trees,  Cube-Connected-Cycles, 
Benes  network,  and  similar  levelled  networks. 
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We  do  not  yet  have  an  analogue  of  Theorem  3  for  embeddings  in  the  shuffle* 
exchange  and  deBruin  graphs9.  However,  using  rather  complicated  arguments,  we 
can  prove  that  any  expansion-O(l)  embedding  of  the  n-vertex  X-tree  or  the  n-vertex 
mesh  in  these  host  graphs  requires  dilation  (l(loglogn).  Since  a  complete  binary 
tree  is  a  spanning  tree  of  the  deBruijn  graph,  the  proof  technique  of  Section  3  shows 
that  this  lower  bound  for  the  X*tree  is  optimal.  We  suspect  that  the  lower  bound 
for  the  mesh  can  be  improved. 

In  order  to  justify  dilation  fully  as  the  central  measure  of  concern  in  network 
embeddings,  it  would  be  nice  to  strengthen  the  results  of  Section  3  to  show  that 
the  Butterfly  can  simulate  any  graph  having  a  \/2-bifurcator  of  size  5  =  H(log  n) 
with  delay  0(log  5).  We  believe  this  to  be  possible  using  the  arguments  of  Section 
2,  but  we  have  not  worked  through  the  details. 

Lastly,  it  should  be  noted  that  our  lower  bounds  do  not  mean  that  a  Butterfly 
cannot  efficiently  simulate  a  mesh  or  X-tree  efficiently  over  a  large  span  of  time.  For 
example,  a  Butterfly  can  simulate  log  n  steps  of  a  mesh  of  a  constant  fraction  smaller 
size  within  O()og  n  log  log  n)  steps,  and  possibly  within  O(logn)  steps.  Similar 
improvements  in  amortized  simulation  times  are  also  possible  for  the  X-tree.  and 
we  are  currently  studying  how  good  such  amortized  simulations  can  be  in  general. 
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